Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 94
Filtrar
1.
Sci Total Environ ; 928: 172592, 2024 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-38642768

RESUMO

Submerged plants affect nitrogen cycling in aquatic ecosystems. However, whether and how submerged plants change nitrous oxide (N2O) production mechanism and emissions flux remains controversial. Current research primarily focuses on the feedback from N2O release to variation of substrate level and microbial communities. It is deficient in connecting the relative contribution of individual N2O production processes (i.e., the N2O partition). Here, we attempted to offer a comprehensive understanding of the N2O mitigation mechanism in aquatic ecosystems on the Changjiang River Delta according to stable isotopic techniques, metagenome-assembly genome analysis, and statistical analysis. We found that the submerged plant reduced 45 % of N2O emissions by slowing down the dissolved inorganic nitrogen conversion velocity to N2O in sediment (Vf-[DIN]sed). It was attributed to changing the N2O partition and suppressing the potential capacity of net N2O production (i.e., nor/nosZ). The dominated production processes showed a shift with increasing excess N2O. Meanwhile, distinct shift thresholds of planted and unplanted habitats reflected different mechanisms of stimulated N2O production. The hotspot zone of N2O production corresponded to high nor/nosZ and unsaturated oxygen (O2) in unplanted habitat. In contrast, planted habitat hotspot has lower nor/nosZ and supersaturated O2. O2 from photosynthesis critically impacted the activities of N2O producers and consumers. In summary, the presence of submerged plants is beneficial to mitigate N2O emissions from aquatic ecosystems.

2.
Brief Bioinform ; 25(3)2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38581420

RESUMO

Protein-ligand interaction prediction presents a significant challenge in drug design. Numerous machine learning and deep learning (DL) models have been developed to accurately identify docking poses of ligands and active compounds against specific targets. However, current models often suffer from inadequate accuracy or lack practical physical significance in their scoring systems. In this research paper, we introduce IGModel, a novel approach that utilizes the geometric information of protein-ligand complexes as input for predicting the root mean square deviation of docking poses and the binding strength (pKd, the negative value of the logarithm of binding affinity) within the same prediction framework. This ensures that the output scores carry intuitive meaning. We extensively evaluate the performance of IGModel on various docking power test sets, including the CASF-2016 benchmark, PDBbind-CrossDocked-Core and DISCO set, consistently achieving state-of-the-art accuracies. Furthermore, we assess IGModel's generalizability and robustness by evaluating it on unbiased test sets and sets containing target structures generated by AlphaFold2. The exceptional performance of IGModel on these sets demonstrates its efficacy. Additionally, we visualize the latent space of protein-ligand interactions encoded by IGModel and conduct interpretability analysis, providing valuable insights. This study presents a novel framework for DL-based prediction of protein-ligand interactions, contributing to the advancement of this field. The IGModel is available at GitHub repository https://github.com/zchwang/IGModel.


Assuntos
Aprendizado Profundo , Proteínas , Proteínas/química , Ligação Proteica , Ligantes , Desenho de Fármacos
3.
Methods ; 224: 35-46, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38373678

RESUMO

Bivalent Smac mimetics have been shown to possess binding affinity and pro-apoptotic activity similar to or more potent than that of native Smac, a protein dimer able to neutralize the anti-apoptotic activity of an inhibitor of caspase enzymes, XIAP, which endows cancer cells with resistance to anticancer drugs. We design five new bivalent Smac mimetics, which are formed by various linkers tethering two diazabicyclic cores being the IAP binding motifs. We built in silico models of the five mimetics by the TwistDock workflow and evaluated their conformational tendency, which suggests that compound 3, whose linker is n-hexylene, possess the highest binding potency among the five. After synthesis of these compounds, their ability in tumour cell growth inhibition and apoptosis induction displayed in experiments with SK-OV-3 and MDA-MB-231 cancer cell lines confirms our prediction. Among the five mimetics, compound 3 displays promising pro-apoptotic activity and deserves further optimization.


Assuntos
Antineoplásicos , Neoplasias , Humanos , Proteínas Inibidoras de Apoptose/metabolismo , Proteínas Inibidoras de Apoptose/farmacologia , Proteínas Inibidoras de Apoptose Ligadas ao Cromossomo X/metabolismo , Proteínas Inibidoras de Apoptose Ligadas ao Cromossomo X/farmacologia , Antineoplásicos/farmacologia , Antineoplásicos/química , Conformação Molecular , Apoptose , Linhagem Celular Tumoral
4.
Int J Mol Sci ; 24(22)2023 Nov 07.
Artigo em Inglês | MEDLINE | ID: mdl-38003217

RESUMO

The automatic detection of cells in microscopy image sequences is a significant task in biomedical research. However, routine microscopy images with cells, which are taken during the process whereby constant division and differentiation occur, are notoriously difficult to detect due to changes in their appearance and number. Recently, convolutional neural network (CNN)-based methods have made significant progress in cell detection and tracking. However, these approaches require many manually annotated data for fully supervised training, which is time-consuming and often requires professional researchers. To alleviate such tiresome and labor-intensive costs, we propose a novel weakly supervised learning cell detection and tracking framework that trains the deep neural network using incomplete initial labels. Our approach uses incomplete cell markers obtained from fluorescent images for initial training on the Induced Pluripotent Stem (iPS) cell dataset, which is rarely studied for cell detection and tracking. During training, the incomplete initial labels were updated iteratively by combining detection and tracking results to obtain a model with better robustness. Our method was evaluated using two fields of the iPS cell dataset, along with the cell detection accuracy (DET) evaluation metric from the Cell Tracking Challenge (CTC) initiative, and it achieved 0.862 and 0.924 DET, respectively. The transferability of the developed model was tested using the public dataset FluoN2DH-GOWT1, which was taken from CTC; this contains two datasets with reference annotations. We randomly removed parts of the annotations in each labeled data to simulate the initial annotations on the public dataset. After training the model on the two datasets, with labels that comprise 10% cell markers, the DET improved from 0.130 to 0.903 and 0.116 to 0.877. When trained with labels that comprise 60% cell markers, the performance was better than the model trained using the supervised learning method. This outcome indicates that the model's performance improved as the quality of the labels used for training increased.


Assuntos
Redes Neurais de Computação , Aprendizado de Máquina Supervisionado , Processamento de Imagem Assistida por Computador/métodos
5.
Brief Bioinform ; 24(6)2023 09 22.
Artigo em Inglês | MEDLINE | ID: mdl-37833842

RESUMO

Recent studies have shed light on the potential of circular RNA (circRNA) as a biomarker for disease diagnosis and as a nucleic acid vaccine. The exploration of these functionalities requires correct circRNA full-length sequences; however, existing assembly tools can only correctly assemble some circRNAs, and their performance can be further improved. Here, we introduce a novel feature known as the junction contig (JC), which is an extension of the back-splice junction (BSJ). Leveraging the strengths of both BSJ and JC, we present a novel method called JCcirc (https://github.com/cbbzhang/JCcirc). It enables efficient reconstruction of all types of circRNA full-length sequences and their alternative isoforms using splice graphs and fragment coverage. Our findings demonstrate the superiority of JCcirc over existing methods on human simulation datasets, and its average F1 score surpasses CircAST by 0.40 and outperforms both CIRI-full and circRNAfull by 0.13. For circRNAs below 400 bp, 400-800 bp, 800 bp-1200 bp and above 1200 bp, the correct assembly rates are 0.13, 0.09, 0.04 and 0.03 higher, respectively, than those achieved by existing methods. Moreover, JCcirc also outperforms existing assembly tools on other five model species datasets and real sequencing datasets. These results show that JCcirc is a robust tool for accurately assembling circRNA full-length sequences, laying the foundation for the functional analysis of circRNAs.


Assuntos
RNA Circular , RNA , Humanos , RNA Circular/genética , Análise de Sequência de RNA/métodos , Isoformas de Proteínas/genética , RNA/genética
6.
J Comput Biol ; 30(9): 951-960, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-37585615

RESUMO

Spiking neural network (SNN) simulators play an important role in neural system modeling and brain function research. They can help scientists reproduce and explore neuronal activities in brain regions, neuroscience, brain-like computing, and other fields and can also be applied to artificial intelligence, machine learning, and other fields. At present, many simulators using central processing unit (CPU) or graphics processing unit (GPU) have been developed. However, due to the randomness of connections between neurons and spiking events in SNN simulation, this causes a lot of memory access time. To alleviate this problem, we developed an SNN simulator SWsnn based on the new Sunway SW26010pro processor. The SW26010pro processor consists of six core groups, each with 16 MB of local data memory (LDM). LDM has the characteristics of high-speed read and write, which is suitable for performing simulation tasks similar to SNNs. Experimental results show that SWsnn runs faster than other mainstream GPU-based simulators when simulating a certain scale of neural network, showing a strong performance advantage. To conduct larger scale simulations, SWsnn designed a simulation computation based on a large shared model of Sunway processor and developed a multiprocessor version of SWsnn based on this mode, achieving larger scale SNN simulations.


Assuntos
Inteligência Artificial , Redes Neurais de Computação , Simulação por Computador , Neurônios/fisiologia , Encéfalo
7.
Proteins ; 91(12): 1837-1849, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37606194

RESUMO

We introduce a deep learning-based ligand pose scoring model called zPoseScore for predicting protein-ligand complexes in the 15th Critical Assessment of Protein Structure Prediction (CASP15). Our contributions are threefold: first, we generate six training and evaluation data sets by employing advanced data augmentation and sampling methods. Second, we redesign the "zFormer" module, inspired by AlphaFold2's Evoformer, to efficiently describe protein-ligand interactions. This module enables the extraction of protein-ligand paired features that lead to accurate predictions. Finally, we develop the zPoseScore framework with zFormer for scoring and ranking ligand poses, allowing for atomic-level protein-ligand feature encoding and fusion to output refined ligand poses and ligand per-atom deviations. Our results demonstrate excellent performance on various testing data sets, achieving Pearson's correlation R = 0.783 and 0.659 for ranking docking decoys generated based on experimental and predicted protein structures of CASF-2016 protein-ligand complexes. Additionally, we obtain an averaged local distance difference test (lDDT pli = 0.558) of AIchemy LIG2 in CASP15 for de novo protein-ligand complex structure predictions. Detailed analysis shows that accurate ligand binding site prediction and side-chain orientation are crucial for achieving better prediction performance. Our proposed model is one of the most accurate protein-ligand pose prediction models and could serve as a valuable tool in small molecule drug discovery.


Assuntos
Proteínas , Ligantes , Ligação Proteica , Proteínas/química , Sítios de Ligação , Simulação de Acoplamento Molecular
9.
Front Genet ; 14: 1248519, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37485341

RESUMO

[This corrects the article DOI: 10.3389/fgene.2022.816825.].

10.
Front Mol Biosci ; 10: 1249019, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37469706

RESUMO

[This corrects the article DOI: 10.3389/fmolb.2022.857320.].

11.
Methods ; 216: 39-50, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37330158

RESUMO

Assessing the quality of sequencing data plays a crucial role in downstream data analysis. However, existing tools often achieve sub-optimal efficiency, especially when dealing with compressed files or performing complicated quality control operations such as over-representation analysis and error correction. We present RabbitQCPlus, an ultra-efficient quality control tool for modern multi-core systems. RabbitQCPlus uses vectorization, memory copy reduction, parallel (de)compression, and optimized data structures to achieve substantial performance gains. It is 1.1 to 5.4 times faster when performing basic quality control operations compared to state-of-the-art applications yet requires fewer compute resources. Moreover, RabbitQCPlus is at least 4 times faster than other applications when processing gzip-compressed FASTQ files and 1.3 times faster with the error correction module turned on. Furthermore, it takes less than 4 minutes to process 280 GB of plain FASTQ sequencing data, while other applications take at least 22 minutes on a 48-core server when enabling the per-read over-representation analysis. C++ sources are available at https://github.com/RabbitBio/RabbitQCPlus.


Assuntos
Compressão de Dados , Software , Sequenciamento de Nucleotídeos em Larga Escala , Controle de Qualidade , Algoritmos , Análise de Sequência de DNA
12.
Genome Biol ; 24(1): 121, 2023 05 17.
Artigo em Inglês | MEDLINE | ID: mdl-37198663

RESUMO

We present RabbitTClust, a fast and memory-efficient genome clustering tool based on sketch-based distance estimation. Our approach enables efficient processing of large-scale datasets by combining dimensionality reduction techniques with streaming and parallelization on modern multi-core platforms. 113,674 complete bacterial genome sequences from RefSeq, 455 GB in FASTA format, can be clustered within less than 6 min and 1,009,738 GenBank assembled bacterial genomes, 4.0 TB in FASTA format, within only 34 min on a 128-core workstation. Our results further identify 1269 redundant genomes, with identical nucleotide content, in the RefSeq bacterial genomes database.


Assuntos
Genoma , Software , Bases de Dados de Ácidos Nucleicos , Análise por Conglomerados , Bactérias , Algoritmos , Genoma Bacteriano
13.
Environ Res ; 227: 115710, 2023 06 15.
Artigo em Inglês | MEDLINE | ID: mdl-36933634

RESUMO

Vegetation restoration projects can not only improve water quality by absorbing and transferring pollutants and nutrients from non-vegetation sources, but also protect biodiversity by providing habitat for biological growth. However, the mechanism of the protistan and bacterial assembly processes in the vegetation restoration project were rarely explored. To address this, based on 18 S rRNA and 16 S rRNA high-throughput sequencing, we investigated the mechanism of protistan and bacterial community assembly processes, environmental conditions, and microbial interactions in the rivers with (out) vegetation restoration. The results indicated that the deterministic process dominated the protistan and bacterial community assembly (94.29% and 92.38%), influenced by biotic and abiotic factors. For biotic factors, microbial network connectivity was higher in the vegetation zone (average degree = 20.34) than in the bare zone (average degree = 11.00). For abiotic factors, the concentration of dissolved organic carbon ([DOC]) was the most important environmental factor affecting the microbial community composition. [DOC] was lower significantly in vegetation zone (18.65 ± 6.34 mg/L) than in the bare zone (28.22 ± 4.82 mg/L). In overlying water, vegetation restoration upregulated the protein-like fluorescence components (C1 and C2) by 1.26 and 1.01-folds and downregulated the terrestrial humic-like fluorescence components (C3 and C4) by 0.54 and 0.55-folds, respectively. The different DOM components guided bacteria and protists to select different interactive relationships. The protein-like DOM components led to bacterial competition, whereas the humus-like DOM components resulted in protistan competition. Finally, the structural equation model was established to explain that DOM components can affect protistan and bacterial diversity by providing substrates, facilitating microbial interactions, and promoting nutrient input. In general, our study provides insights into the responses of vegetation restored ecosystems to the dynamics and interactives in the anthropogenically influenced river and evaluates the ecological restoration performance of vegetation restoration from a molecular biology perspective.


Assuntos
Matéria Orgânica Dissolvida , Microbiota , Rios/química , Qualidade da Água , Bactérias/genética , Espectrometria de Fluorescência
14.
J Chem Inf Model ; 63(3): 835-845, 2023 02 13.
Artigo em Inglês | MEDLINE | ID: mdl-36724090

RESUMO

Many bioactive peptides demonstrated therapeutic effects over complicated diseases, such as antiviral, antibacterial, anticancer, etc. It is possible to generate a large number of potentially bioactive peptides using deep learning in a manner analogous to the generation of de novo chemical compounds using the acquired bioactive peptides as a training set. Such generative techniques would be significant for drug development since peptides are much easier and cheaper to synthesize than compounds. Despite the limited availability of deep learning-based peptide-generating models, we have built an LSTM model (called LSTM_Pep) to generate de novo peptides and fine-tuned the model to generate de novo peptides with specific prospective therapeutic benefits. Remarkably, the Antimicrobial Peptide Database has been effectively utilized to generate various kinds of potential active de novo peptides. We proposed a pipeline for screening those generated peptides for a given target and used the main protease of SARS-COV-2 as a proof-of-concept. Moreover, we have developed a deep learning-based protein-peptide prediction model (DeepPep) for rapid screening of the generated peptides for the given targets. Together with the generating model, we have demonstrated that iteratively fine-tuning training, generating, and screening peptides for higher-predicted binding affinity peptides can be achieved. Our work sheds light on developing deep learning-based methods and pipelines to effectively generate and obtain bioactive peptides with a specific therapeutic effect and showcases how artificial intelligence can help discover de novo bioactive peptides that can bind to a particular target.


Assuntos
COVID-19 , Aprendizado Profundo , Humanos , Inteligência Artificial , Desenho de Fármacos , SARS-CoV-2 , Peptídeos/farmacologia
15.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36502369

RESUMO

The recently reported machine learning- or deep learning-based scoring functions (SFs) have shown exciting performance in predicting protein-ligand binding affinities with fruitful application prospects. However, the differentiation between highly similar ligand conformations, including the native binding pose (the global energy minimum state), remains challenging that could greatly enhance the docking. In this work, we propose a fully differentiable, end-to-end framework for ligand pose optimization based on a hybrid SF called DeepRMSD+Vina combined with a multi-layer perceptron (DeepRMSD) and the traditional AutoDock Vina SF. The DeepRMSD+Vina, which combines (1) the root mean square deviation (RMSD) of the docking pose with respect to the native pose and (2) the AutoDock Vina score, is fully differentiable; thus is capable of optimizing the ligand binding pose to the energy-lowest conformation. Evaluated by the CASF-2016 docking power dataset, the DeepRMSD+Vina reaches a success rate of 94.4%, which outperforms most reported SFs to date. We evaluated the ligand conformation optimization framework in practical molecular docking scenarios (redocking and cross-docking tasks), revealing the high potentialities of this framework in drug design and discovery. Structural analysis shows that this framework has the ability to identify key physical interactions in protein-ligand binding, such as hydrogen-bonding. Our work provides a paradigm for optimizing ligand conformations based on deep learning algorithms. The DeepRMSD+Vina model and the optimization framework are available at GitHub repository https://github.com/zchwang/DeepRMSD-Vina_Optimization.


Assuntos
Aprendizado Profundo , Ligantes , Simulação de Acoplamento Molecular , Proteínas/química , Desenho de Fármacos , Ligação Proteica
16.
IEEE/ACM Trans Comput Biol Bioinform ; 20(3): 2341-2348, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36327193

RESUMO

The continuous growth of generated sequencing data leads to the development of a variety of associated bioinformatics tools. However, many of them are not able to fully exploit the resources of modern multi-core systems since they are bottlenecked by parsing files leading to slow execution times. This motivates the design of an efficient method for parsing sequencing data that can exploit the power of modern hardware, especially for modern CPUs with fast storage devices. We have developed RabbitFX, a fast, efficient, and easy-to-use framework for processing biological sequencing data on modern multi-core platforms. It can efficiently read FASTA and FASTQ files by combining a lightweight parsing method by means of an optimized formatting implementation. Furthermore, we provide user-friendly and modularized C++ APIs that can be easily integrated into applications in order to increase their file parsing speed. As proof-of-concept, we have integrated RabbitFX into three I/O-intensive applications: fastp, Ktrim, and Mash. Our evaluation shows that the inclusion of RabbitFX leads to speedups of at least 11.6 (6.6), 2.4 (2.4), and 3.7 (3.2) compared to the original versions on plain (gzip-compressed) files, respectively. These case studies demonstrate that RabbitFX can be easily integrated into a variety of NGS analysis tools to significantly reduce associated runtimes. It is open source software available at https://github.com/RabbitBio/RabbitFX.


Assuntos
Biologia Computacional , Software , Sequenciamento de Nucleotídeos em Larga Escala
17.
Comput Struct Biotechnol J ; 20: 6149-6162, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36420153

RESUMO

The etiology of neuropsychiatric disorders involves complex biological processes at different omics layers, such as genomics, transcriptomics, epigenetics, proteomics, and metabolomics. The advent of high-throughput technology, as well as the availability of large open-source datasets, has ushered in a new era in system biology, necessitating the integration of various types of omics data. The complexity of biological mechanisms, the limitations of integrative strategies, and the heterogeneity of multi-omics data have all presented significant challenges to computational scientists. In comparison to early and late integration, intermediate integration may transform each data type into appropriate intermediate representations using various data transformation techniques, allowing it to capture more complementary information contained in each omics and highlight new interactions across omics layers. Here, we reviewed multi-modal intermediate integrative techniques based on component analysis, matrix factorization, similarity network, multiple kernel learning, Bayesian network, artificial neural networks, and graph transformation, as well as their applications in neuropsychiatric domains. We depicted advancements in these approaches and compared the strengths and weaknesses of each method examined. We believe that our findings will aid researchers in their understanding of the transformation and integration of multi-omics data in neuropsychiatric disorders.

19.
Nanoscale Adv ; 4(11): 2399-2411, 2022 May 31.
Artigo em Inglês | MEDLINE | ID: mdl-36134127

RESUMO

Organic ultrathin semiconductor nanostructures have attracted continuous attention in recent years owing to their excellent charge transport capability, favorable flexibility, solution-processability and adjustable photoelectric properties, providing opportunities for next-generation optoelectronic applications. For integrated electronics, organic ultrathin nanostructures need to be prepared as large-area patterns with precise alignment and high crystallinity to achieve organic electronic devices with high performance and high throughput. However, the fabrication of organic ultrathin nanostructure arrays still remains challenging due to uncontrollable growth along the height direction in solution processes. In this review, we first introduce the properties, assembly methods and applications of four typical organic ultrathin nanostructures, including small molecules, polymers, and other organic-inorganic hybrid materials. Five categories of representative solution-processing techniques for patterning organic micro- and nanostructures are summarized and discussed. Finally, challenges and perspectives in the controllable preparation of organic ultrathin arrays and potential applications are featured on the basis of their current development.

20.
Cell Death Dis ; 13(9): 827, 2022 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-36167685

RESUMO

Circular RNAs (circRNAs) have been reported to play essential roles in tumorigenesis and progression. This study aimed to identify dysregulated circRNAs in gastric cancer (GC) and investigate the functions and underlying mechanism of these circRNAs in GC development. Here, we identify circ_CEA, a circRNA derived from the back-splicing of CEA cell adhesion molecule 5 (CEA) gene, as a novel oncogenic driver of GC. Circ_CEA is significantly upregulated in GC tissues and cell lines. Circ_CEA knockdown suppresses GC progression, and enhances stress-induced apoptosis in vitro and in vivo. Mechanistically, circ_CEA interacts with p53 and cyclin-dependent kinases 1 (CDK1) proteins. It serves as a scaffold to enhance the association between p53 and CDK1. As a result, circ_CEA promotes CDK1-mediated p53 phosphorylation at Ser315, then decreases p53 nuclear retention and suppresses its activity, leading to the downregulation of p53 target genes associated with apoptosis. These findings suggest that circ_CEA protects GC cells from stress-induced apoptosis, via acting as a protein scaffold and interacting with p53 and CDK1 proteins. Combinational therapy of targeting circ_CEA and chemo-drug caused more cell apoptosis, decreased tumor volume and alleviated side effect induced by chemo-drug. Therefore, targeting circ_CEA might present a novel treatment strategy for GC.


Assuntos
MicroRNAs , Neoplasias Gástricas , Apoptose/genética , Antígeno Carcinoembrionário/genética , Antígeno Carcinoembrionário/metabolismo , Linhagem Celular Tumoral , Proliferação de Células , Regulação Neoplásica da Expressão Gênica , Humanos , MicroRNAs/genética , RNA Circular/genética , Neoplasias Gástricas/patologia , Proteína Supressora de Tumor p53/genética , Proteína Supressora de Tumor p53/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...